ENTROPY AND DIFFRACTION OF THE fc-FREE POINTS 
IN n-DIMENSIONAL LATTICES 
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Abstract. We consider the fcth-power-free points in n-dimensional lat- 
tices and explicitly calculate their entropies and diffraction spectra. This 
is of particular interest since these sets have holes of unbounded inradius. 



Preface 

Peter A. B. Pleasants gave me (CH) an early draft of this paper already 
in 2006. After his untimely death in 2008, Michael Baake asked me to finish 
the manuscript. At that time, it already contained the entire calculation 
of the entropies. Moreover, Peter had planned two further sections, one on 
improved error terms and one on a model set construction including the sets 
in question together with an upper bound for the topological entropies that 
is intrinsic to the corresponding window. While the former is now included, 
the latter is still work in progress. Instead, the text now has one section on 
a variational principle and one on the diffraction of the sets studied here. 



1. Introduction 

In [5] the diffraction properties of the visible points and the fcth-power- 
free numbers were studied and it was shown that these sets have positive, 
pure-point, translation-bounded diffraction spectra with countable, dense 
support. The interest of this lay in the fact that these sets fail to be De- 
lone sets: they are uniformly discrete (subsets of lattices, in fact) but not 
relatively dense. The lack of relative denseness means that these sets have 
arbitrarily large "holes" and hence are not repetitive in the sense of [11]. It 
is of interest to ask for more precise information about the irregularity of 
these sets, and Lenz [12] has asked what their entropy is. 

There are two kinds of entropy commonly associated with arrays of sym- 
bols (of which subsets of lattices are a particular case): patch- counting en- 
tropy which is defined simply by counting patches and depends only on the 
adjacency relation between sites, not on any metric of the ambient space; and 
measure entropy which is defined in terms of the frequency of occurrence of 
patches in space. The patch-counting entropy is an upper bound for the mea- 
sure entropy, whatever measure is used. We show that the sets considered 
here have measure entropy zero (relative to a canonically constructed mea- 
sure) but positive patch-counting entropy, contrasting with regular model 
sets [IB] , for which both entropies are zero [I]. In [5], a model set construc- 
tion for the visible points and the fcth-power-free numbers was described, 
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with the internal spaces adelic, instead of Euclidean as in more usual cut- 
and-project sets. In this construction, the boundaries of the windows have 
positive measure, however, so they are not regular model sets. 

In Section [21 we define patch-counting and measure entropies, while in 
Section [3] we define the set of A;-free points, whose entropies we investigate, 
and show that they possess patch frequencies which can be explicitly calcu- 
lated in terms of infinite products. This is just a mild generalization to the 
case of lattices other than Z of the results of Mirsky [2] on fcth-power-free 
integers. To keep the route to our main results as clear as possible we have 
been content with weak error terms in Section [3l but for the record we show 
in Section [8] how error terms like those in [13] carry over to the general 
case. Section |U gives some examples of patch frequencies and Section [5] com- 
pletes the calculation of the entropies, with the aid of a key lemma (for the 
measure entropy case) that gives a small upper bound for the frequencies 
of the great majority of patches. In Section [6l we give a short discussion 
of the variational principle, which relates the two kinds of entropy. In Sec- 
tion [7J we demonstrate how the results in [5] on the diffraction spectra of the 
A;th-power-free integers and visible lattice points carry over to the general 
case. 

In the course of the paper, we need to call on a number of standard results 
in number theory, which for convenience we have collected in an appendix 
(whose equation numbers carry a prefix 'A'). 

2. Definitions of entropy 

Let X be a subset of a lattice A in W 1 . Given a radius p > and a point 
t £ A, the p-patch of X at t is 

(i-t)nSp(o), 

the translation to the origin of the part of X within a distance p of t. We 
denote by A(p) the set of all p-patches of X and by N(p) = \A(p)\ the 
number of distinct /9-patches of X. Then the patch- counting entropy of X is 

(1) WJ() := Um 

where v n is the volume of an re-dimensional ball of radius 1, i.e. v n = 
7r n / 2 /r(l + 5) (so that the denominator is the volume of the open ball 
B p (0)). It can be shown by a subadditivity argument that this limit exists 
for every X C A. In [H Theorem 1 and Remark 2] Baake, Lenz and Richard 
show that, for the dynamical system of coloured Delone sets of finite local 
complexity, the patch-counting entropy coincides with the topological en- 
tropy; see Section [6] for more on the natural dynamical system associated 
with a subset X o£ A and the fc-free points in particular. 

To describe measure entropy, we must take into account densities of sub- 
sets of a lattice. If Y C A, its density 5(Y) is defined by 

(2) XT) :- lim l Fn „ B *<°", 

W y J R-^oo R n V n 

when the limit exists; cf. [5] for related ways of defining densities of discrete 
point sets. In cases where the limit does not exist, we can still define an 
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upper density, S(Y) and a lower density, 5(Y), by replacing the limit in ([2]) 
by limsup or liminf. The frequency, of a p-patch V of X is defined 

by 

(3) v(V) := 6({t G A : the p-patch of X at t is V}), 

when this density exists. In the absence of a well defined density, we can 
still define an upper frequency, v(V) and a lower frequency, v(V), by replac- 
ing 5 by d or 5. The measure entropy of X, which can be thought of as 
corresponding to the metric entropy of a dynamical system, is now defined 
by 

(4) h meas (X) := lim V log 2 

with the convention that v log 2 ^ = when v = 0; see Section[6]for details. It 
is defined when every patch of X has a well defined frequency, in which case 
a subadditivity argument again shows that the limit exists. Since v log 2 v is 
a convex function of v, the sum does not decrease if we replace the ^("P)'s 
by their average value, 1/N(p), to make the right side the same as the right 
side of (HJ). Hence 

As a simple example where these entropies differ, consider the binary 
sequence consisting of the binary numbers in order (0, 1, 10, 11, 100, . . . ) 
with n, n + 1 separated by n l's: 

0111011111111001111101111111101111111111111111 1000 1111 .... 

Evidently, there are very few 0's to contribute variety here. In fact the 
sequence of 0's has density zero, and consequently any finite word that is 
not all l's has frequency zero. So h meas = 0. But since there are 2 l possible 
words of length I and every word occurs somewhere, h pc = 1. 

In general, h pc is a combinatorial function of the set of finite configurations 
that occur, while h meas is a geometric function of an infinite configuration 
and can differ among different infinite configurations built up from the same 
set of finite ones, with h pc being an upper bound for the possible values it 
can take. Of the two entropies, h meas would appear to carry more physical 
significance. 

More generally, if we have a pattern formed by labelling the points of A 
with letters from an o-letter alphabet then we can again define p-patches, 
and the entropies of the pattern are given by ([1]) and with 2 replaced 
by a as the base of logarithms. A subset of A corresponds to a 2-letter 
labelling indicating whether or not a site is occupied. The reason for the 
patch volume in the denominator and for the choice of base of logarithms 
is to normalize so that the integer lattice with random labelling has both 
entropies 1. 

There are various ways in which the definition of measure entropy might 
be extended to sets X for which not all patch frequencies exist. A first step 
would be to replace the sum in by 

Hm ^ \L(V)nB R (0)\ ( \L(V)nB R (0)\ \ 

R ^ P iX( P ) RnVn ^ RnVn ' 
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where L(V) is the set appearing in ([3]). This delays taking the limit, so 
that it has a chance of existing even when some individual patch frequencies 
may fail to exist. We shall not need such extensions here, however, since 
Theorem [1] below guarantees that, for the sets studied in this paper, all 
patch frequencies exist. 

3. /c-FREE POINTS 

As a convenient context for our results, we shall use the set V = V(A, k) 
of /c-free points of a lattice A in W 1 . For a point i / in A define its 
k-content, cjt(i), to be the largest integer c such that I G c k A. Then c&(£) 
is also the least common multiple of the numbers d with d~ k l € A, i.e. 
dT k l € A if and only if d | Cfc(Z). For consistency and convenience, we define 
Cfc(O) = oo, with the understanding that d | oo for any number d. The 
k-free points, V = V(A, k), of A are the points with Cfc(Z) = 1. One can see 
that V is non-periodic, i.e. V has no nonzero translational symmetries. As 
particular cases we have the visible points of A (with n > 2 and k = 1), 
treated in [5J, and the /c-free integers (with A = Z), treated in [3], [p3] and 
|14| . The more general context has the advantage of avoiding duplication of 
near-identical proofs. When n = k = 1, V consists of just the two points of 
A closest to on either side, and we exclude this trivial case. Since A is a free 
Abelian group of rank n, its automorphism group, Aut(A), is isomorphic to 
the matrix group GL(ra, Z). Explicit isomorphisms can be found by taking 
coordinates with respect to any basis of A. Since the action of GL(n, Z) on 
A preserves /c-content, the /c-free points V are invariant under the action of 
GL(n,Z). 

Proposition 1. V is uniformly discrete, but has arbitrarily large holes. 
Moreover, for any r > 0, there is a set of holes in V of inradius at least r 
whose centres have positive density. 

Proof. Since V C A, the uniform discreteness is trivial. Now let C = 
{a\, . . . ,a s } be any finite configuration of points in A (e.g., all points in 
a ball or a cube). Choose s integers mi, . . . ,m s > 1 that are pairwise co- 
prime (e.g., the first s primes). By ()A10p . there is a point a £ A with 

a = —ai (mod m k A) 

for i = 1, . . . , s. Now for any x = a (mod m\ ■ ■ ■ m k K) the configuration 
C + x = {ai + x, . . . , a s + x} is congruent, in the geometric sense, to C but 
no point in C + x is in V, since + x £ m k A for i = 1, . . . , s. The points 
x have density l/((mi • • • m s ) nk det(A)) > by and (|A10l) . □ 

For a natural number P, we define Vp = Vp(A, k) to be the set of points 
I £ A \ {0} with (cfc(Z), P) = 1. Clearly Vp is fully periodic with a lattice of 
periods that contains P k A. The Vp's are partially ordered inversely to the 
divisibility partial order on N, that is, Vpq C Vp for all P, Q. In fact, more 
precisely, Vpq = Vp Pi Vq. The intersection of all the Vp's is V, so if P is 
divisible by all primes up to a large bound Vp can be regarded as a set of 
"potentially /c-free" points. 

For a finite subset J 7 of A and a positive integer m, we shall use 

J/mA 
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to denote the set of cosets of mA in A that are represented in T . We also 
write 



for the diameter of J- . 

Since entropies of sets in M n vary under change of scale inversely as the 
nth power of the scaling constant, it is sufficient to consider lattices of 
determinant 1. (For other lattices the formula for the entropy of V must 
simply be divided by the determinant of A.) We fix the following notation 
for the rest of this paper: 

A is a lattice of determinant 1 in 1", A is the length of its 
shortest nonzero vector, k is a natural number (with k > 2 if 
n = 1) and V is the set of k-free points in A. 

Also, for subsets X, V, Q of A, with X infinite but V, Q finite, we define 
the locator set 



consisting of those lattice translations that locate V totally inside X and Q 
totally outside X. 

The genesis of our proof of positive, but non-maximal, patch-counting 
entropy for the visible points is the observation that, of the 4 corners of 
any unit square of the integer lattice in the plane, at least one is invisible 
(because both its coordinates are even) but each of the 15 possibilities for 
the visibility or not of the corners, when the possibility of their all being 
visible is excluded, can occur, depending on the position of the square within 
the lattice. This is the simplest example of the fact that, in general, every 
p-patch contains an irreducible minimum of points not in V but for the 
remaining points in the patch we can arrange that they are visible or not, 
independently of each other by choosing the position of the patch in the 
lattice. This leads to an exponentially large number of p-patches, the number 
of which can be estimated quite accurately. 

Our aim with the following lemma is to concentrate most of the necessary 
inclusion-exclusion arguments into a single result from which ensuing results 
can be fairly readily derived. For this reason it has several parameters (V, m, 
m, P and x) and three components to its error term. Until the parameters 
are further specified, there is no assumption that the error terms are of 
smaller order than the main term. To keep the proof short we have not 
made the error terms as small as possible — in Section [5] we make use of the 
technique of [13J to vastly improve the last error term. 

Lemma 1. Let V be a finite subset of A, m G N, m £ A, P be a natural 
number coprime to m and x E W 1 . Then 



D{J-) := max |Z — m 



L{X- V, Q) := {t E A : V + t C X, Q + t C A \ X} 



\L(V P ;V,$) n (m + mA)nB R (x)\ 



is estimated by a main term 
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with error 

(6) o(i? 1 / fc + i? n - 1 (min{loglogP,logS})l p l+min{T| 7 ,| +1 (P),(5/A)l p l/ fc }), 

where S := R + \\x\\ + max pg p ||p||, r r is the r- divisor function in (|A6j) . and 
the O-constant depends only on A, k and V . 

Proof. We may clearly assume that P is squarefree. For each prime p the 
points t with t + V C V p consist of p nk — \V/p k A\ cosets of p k A in A (those 
cosets t + p k A with (-t + p k A) n "P = 0). Clearly \V/p k A\ = \V\ when 
p fc A > D(V). Let Q be the product of those prime factors p of P with 
\V/p k A\ < \V\. By the Chinese Remainder Theorem (jA"lOj) . L(Vq;:P,0) n 
(m + mA) consists of 

\{(p nk -\V/p k A\) 

v\Q 

cosets of mQ k A in A. For each such coset q + mQ k A we have 

(7) (q + mQ k A) n L(y P ; 7>, 0) = (g + mQ fc A) n L(^p/ Q ; P, 0). 

Now write V = . . . ,p r }. Since Pi + t E Vp/Q if and only if Cfe(pj + 1) 
is coprime to P/Q, it follows from (|A2|) that for each of these cosets the 
cardinal of L(V P ; V, 0) n (g + mQ fc A) n ^(a;) is 

e n e M- 

t<=ADB R (x) i=l d\P/Q 
t-qemQ k A d\c k ( Pi +t) 

Reversing the order of summation gives 

(8) y, E E •••*) E L 

dl|P/Q o!2|P/Q o!r|P/Q teAnB fl (a;) 
< 5/A for each i t£q+mQ k A 

te- Pi +d k A 

where replacing fJ,(d±) ■ ■ ■ fi(d r ) by n(d% ■ ■ ■ d r ) is justified by the fact that 
the diS are pairwise coprime since any common factor of Ck{Pi + t) and 
c k(Pj + t) divides Ck(p i — pA all of whose prime factors divide Q. Writing 
d\ • • • d r = d and noting that (mQ, d) = 1, we can apply (|A9p with A replaced 
by m(dQ) k A to obtain, for the inner sum, the estimate 

Rli.. 

(9) + 0{R^/m-\dQt-^) + 0(1). 

Substituting this estimate in ([8]) gives a main term 

R n v n 



n i-i 



mnQnk p \p)q^ pn 

with error term ©. The main term arises by removing the conditions 
d\ < S/X from the sum of the main term in ([9]) then using the fact that 
fi(d)T r (d) (where T r (d) is the number of ways of expressing das a product of 
r natural numbers) is a multiplicative function, whose value is — r at primes 
and at prime powers, to express the extended sum as an Euler product, 
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as in (|A8j) . The first error term in ((6]) comes from the extra terms included 
in the extended multiple sum, so is 



<I^l y J_y ±....y ±. = o(R n s- n+ ^ 

- m nQnk ^nk ^nk ^nk \ 

d\>S/\ 1 rfa=l 2 dr=l r 



since each of the r — 1 complete sums is < ("(2) < 2. The other two error 
terms account for the sum over the error terms in ([5]). The logarithms in 
the middle error term are necessary only in the case n = 2, k = 1, when the 
series Yl ^ ^™ diverge but the partial sums can be estimated by using 
(|A7j) or the standard estimate for the partial sums of the harmonic series. 
In all other cases these series converge and the middle error term can be 
taken as 0(P n_1 ). (When n = 1, there is no middle error term, since the 
first error term in ([9]) is then the same as the last.) 

Finally, summing over the cosets of mQ k A that make up L(Vq;V,$) n 
(m + raA) gives the main term ([5]) (since \P/p A| = r when p \ Q) and 
increases the error term by a factor at most Q k , which is bounded in terms 
of k and V. □ 

Corollary 1. If p is a positive radius and P is a natural number divisible 
by every prime less than log p, then 

(10) \v P nB p (o)\ = ^- )+ o( P n ), 

(11) |yn ^ (0)l = |S) +o( ^ ) ' 

(12) \(V P \V)nB p (0)\=o(p n ) 
and, for any x 6 M. n , 

(13) |Vpn^(a0l<^ + O (p n ), 
where £ is the Riemann £ -function. 

Proof. For (|10p we use the lemma with V = {0}, m = 1, x = and R = p, 
then replace the product by l/((nk) using (|A4p . with = logp, and ()A3p . 
This gives ([TO]) with error term 0(p n / log nfc_1 p). 

Clearly Vq D B p (0) = V (1 B p (0) when Q is the product of all primes less 
than (p/A) 1//fe (where A is the length of the shortest nonzero vector in A), 
giving (fTT|) . and (fT2|) results from subtracting this from (fT0|) , 

For (fT3|) we use the lemma with P replaced by P', the product of the 
primes less than log p, together with (|A4|) and ()A3p , and note that Vp C Vp> . 
Then 

IVpnP^)! < \Vp,nB p (x)\ = ^- ) +o{ P n /io g nk - 1 P ), 

since logP' = O(logp) and hence log log P' and r(P') are both O(/0 e ) by 
and (15511. □ 
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We note that (jlip tells us that V has density 1/C(nk), generalizing Propo- 
sitions 6 and 11 of [5 J (though the error terms are not as good as those in |5j 
and much worse than those in p3] and |13j). Also, one might regard (|13p as 
saying that V has a "uniform upper density" (or that A \ V has a uniform 
lower density). 

The following two theorems carry over to /c-free points the results of 
Mirsky [13|. 114] ( |13| improves the error terms in [14jjj on A;- free numbers. 
A weaker result for squarefree numbers goes back to Pillai [18]. Again, we 
make no attempt in Theorem [1] to match the error term of [13J . postponing 
this to Section [HJ 

Theorem 1. For any two disjoint finite subsets V and Q of A, L{V\V, Q) 
has a well defined density given by 

J: , 



E(-^'n(i-^^). 



Proof. By the inclusion-exclusion principle (|A1[) applied to L(V;V,$) H 
B R (0), with Pi being the property that Qj + t £ V (where Q = {q 1 , q 2 , ■ ■ •}), 
we have 

\L(V; V, Q) n B R (Q)\ = ^ (-1)^ \L(V; PUJ,8)n B R (0)\. 

Now Lemma [1] with P equal to the product of the primes less than log R 
gives the estimate 

p<io g i?A p ' 

for |L(V P ;7 ? UJ : ,0)n B R (0)\, the proof of <g2§ of Corollary rj shows that V P 
can be replaced by V at the expense of an extra error term 0(R n / (log R) nk ~~ l ) 
and (|A4p allows the product to be extended over all primes with a similar 
extra error term. Altogether, this gives the estimate 

R«V n £(-l) |Jr| Ilfl- l{VUJ nl /pkM ) +0(^7(l0gfir fc - 1 ) 

iox\L(V;V,Q)nB R (0)\. □ 

Theorem 2. For disjoint finite subsets V and Q of A, the following state- 
ments are equivalent: 

(i) \P/p k A\ < p nk for every prime p; 

(ii) L(V;V,Q) is non-empty; 

(iii) L(V A ;'P, Q) has positive density. 

Proof. Clearly (iii) implies (ii) and, almost as clearly, (ii) implies (i), since 
if V contains a complete set of coset representatives for p k A then, for every 
t G A, some point of V + 1 is in p k A (so not in V). 



■*"As pointed out by Jorg Briidern, the work of Tsang [2T] can be extended to the case 
of fc-free numbers and gives a further small improvement. 
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Now assume (i) holds. For each g G Q choose a different prime p(q) > 
(D(V U Q)/A) 1//fc and let m be the product of the p(g)'s. By the Chinese 
Remainder Theorem, there is an m G A such that 

t = m (mod m k A) -<=^ t = —q (mod p(q) k A) Vg G Q. 

Then for t = m (mod m k A) we have Q + t C A \ V and P + tcV m (the 
latter using the fact that for every p G V and every prime factor p(q) of m, 
q + t G p(q) k A and ||p — g|| < p(q) k X, ensuring that p + t p(q) k A). Now 
Lemma [H with P the product of the primes less than log R not dividing 
m, gives a main term Ci? n with error 0(max{i?™ _1 log -R, i? 1//fc }) for the 
cardinal of a subset of the points i £ An -Br(O) with V + t C y m p and 
Q + t C A \ V, where the constant C is positive since the product in §5§ 
has every term positive. By (|12p of Corollary [H the number of these points 
with V + 1 (£ V is o(R n ). Hence L(V;V, Q) has positive lower density, and 
so, by Theorem [H positive density. □ 

An interesting feature of Theorem[2]is that the criterion (i) is independent 
of Q. This means, for example, that 

L(V; P, 0) ^ ^ S(L{V; V, Q)) > VQ with V n Q = 0, 

which tells us, in particular, that every subset of a patch of V is a patch of 
V. 

4. Examples 

Theorem [T] allows us to calculate the frequencies of p-patches of V in 
terms of the products 

for r = 0, 1, . . . , | A n B p (0)\. Here, we give two simple examples that both 
have nk = 2 and that have |A n B p (0)\ = 3 and 5, respectively. So we need 
the products 

^(2) = 1/C(2) = 6/vr 2 = 0.6079271 . . . , 

n 2 (2) = 0.3226340 . . . (the Feller-Tornier constant), 

n 3 (2) = 0.1254869. .. , n 4 (2) = 0.3785994 ... , n 5 (2) = 0.2733455 ... , 

whose values can be calculated efficiently by the method described in |17| . 

Our first example is to find the frequencies of all 2-patches when V is the 
set of squarefree numbers. Here A = Z, n = 1, k = 2 and |A n 52(0) I = 3. 
Since —1,0,1 are distinct mod p k , for every p, \(V U F)/p k \ = \V U F\ 
and ^(V) depends only on \V\ in this case. Table [T] gives the frequencies 
of 2-patches of all possible sizes, both in terms of the above products and 
numerically, and Figure [T] depicts the patches themselves, with their fre- 
quencies. There are 3 patches each of sizes 1 and 2, and we check that the 
sum, X^K? 3 )' °f the frequencies of all patches is 1 and that the average 
patch size, Yl K^)!^!) i s 3<5(V) = 18/-7T 2 . The patches of size 2 are the most 
frequent, as is to be expected since 2 is the closest integer to 35 (V): indeed, 
59% of all locations have patches of size 2. The empty patch is by far the 
rarest, occurring at less than 2% of locations. The radius p = 2 is the largest 
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\v\ 


v{V) 


Numerical value 


3 






n 3 




0.125486980905... 


2 




n 2 (2) 


-n 3 




0.197147118033... 


1 


Hi(2) 


- 2n 2 (2) 


+ n 3 




0.088145884881 . . . 





i - 3ni(2) 


+ 3n 2 (2) 


-n 3 


:2) 


0.018634010349 . . . 



Table 1. Frequencies of the 2-patches of the squarefree numbers. 



• • • 

v = 0.12549 

• • o 

v = 0.19715 



• o • 

v = 0.19715 

o • • 

v = 0.19715 



o • o 

v = 0.08815 

• o o 

v = 0.08815 



o o o 
v = 0.01863 

o o • 

v = 0.08815 



Figure 1 . The 2-patches of the squarefree numbers (A = Z, 
k = 2) with their frequencies accurate to 5 decimal places. 
The black dots are points of V and the open circles other 
lattice points. The top row contains the patches with mirror 
symmetry and the bottom row the two mirror image pairs. 



for which every subset of An B p (0) occurs as a patch of V: of the 32 subsets 
of A n -B 3 (0) the 3 that contain 4 or 5 consecutive points do not occur as 
patches of V. 

Our other example is the \/2-patches of the visible points, V, in Z 2 , where 
A = Z 2 , n = 2, k = 1 and |An -82(0)! = 5. Figure [2] shows the different 
patches, up to symmetry, with their frequencies. The four patches in the 
top row have the full dihedral symmetry D^; the two in the second row have 
symmetry D2, and give rise to another patch on rotation through vr/2; the 
remaining six patches have only reflection symmetry, and each gives rise to 
three others on rotation through ±7r/2 and ir. We can again check that 
X>(7>) = 1 and X>(7>)|7 ? | = 56(V) = 30/vr 2 . This time, however, the 
frequencies do not depend only on \V\, and indeed no two patches that are 
not symmetry related have the same frequency. Of the five patches with 
\V\ = 4, the symmetric one has frequency nearly 5 times that of each of 
the other four, and the ratio of the frequencies of two of the patches with 
\V\ = 3 is nearly 30. Surprisingly, one of the patches with \V\ =3 (the 
commonest patch size) has frequency smaller than that of any patch except 
the empty one. The empty patch itself occurs at less than 1 in 900 locations. 
As in the previous example, y/2 is the largest radius for which every subset 
of A n B p (0) is a patch: of the 512 subsets of A n B yg(0), the 135 that 
contain all four vertices of a lattice square do not occur as patches of V. 



5. Entropy calculations 
Theorem 3. h pc (V) = 1/C(nk). 
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Figure 2. The v^-patches of the visible points of Z 2 , up to 
symmetry, with their frequencies, accurate to 5 decimal 
places. 



Proof. For each radius p > let P = P(p) be the product of the primes p 
with p nk < | A n B p (0)\. Then P is divisible by every prime less than log p 
when p is large enough. 

The p-patch of V at any point t G A is a subset of (Vp — t) PI B p (0) and 
by (|13p of Corollary [1] the cardinal of this set is at most p n v n / Q(nk) + o(p n ). 
Also there are at most P nk possibilities for Vp — t as t varies, since P k A is 
the lattice of periods of Vp. So 



log^Go) < 



(14) 



< 



P Vn 

C(nk) 

P n v n 
C(nk) 



+ o(p n ) +nklog 2 P 
+ {p n ) + 0{p 1 / k ), 



since log 2 P = Oip 1 ^), by (JA5]) . 

To bound log 2 N(p) below we note that every subset V of V Pi B p (0) is 
the p-patch of V at some point of A, by Theorem [2] with Q = Af]B p (0) \ V. 
By (HI]) of Corollary U \ V Pi B p (0)\ = p n v n /C{nk) + o(p n ), so 



(15) 
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On dividing by p n v n and letting p tend to infinity, (Q3|) and (fT5j) give 
h pc (V) = l/((nk). ' □ 

To bound the measure entropy we need the following lemma, which en- 
ables us to obtain good upper bounds for the frequency of "sparse" patches 
of V, i.e. patches that contain few points in comparison to their size. 

Lemma 2. Let V and Q be disjoint finite subsets of A, let Q be the product 
of all primes p with 

(16) |S//A|<|S| 
and define 

s:= min \(Q + t)nV Q \. 

Then 

(17) 5(L(V;V,Q)) = 0{A^ D ^' x ^ /knk /\Q\ s ~ s / nk ) , 
where the O- constant depends only on A. 

Proof. If t G L(V; V, Q) then for each q G Q n (Vq — t) there is a prime 
p(q) \ Q with q + t G p(q) k A, and by the definition of Q these primes are 
distinct. By (fT2j) of Corollary Q] with p = R+max q( zQ \\q\\ and P the product 
of the primes less than logp, the number of points t G L(V;V, Q) n -Br(O) 
for which > logp for some q G Q Pi (Vq — t) is o(p n ). The remaining 
i's in -L(V;-P, Q) n 5^(0) have p(qr) < logp for each q G Q D (Vq — t). 
For the number of such t with a given set Q n (Vq — t) = . . . , q t } and 
a given ordered set of primes {p(qx), ■ ■ ■ ,p(lt)}, (|A9|) with A replaced by 
(p(<Zl) " " ~P( ( lt)) k A-' gives the estimate 

CR n 

~ (p(Qi)---p(Qt)) nk 

when R is large enough to ensure that R n > (log p) nk \Q\ , where the constant 
C depends only on A. The sum of this over all sets of t primes not dividing 
Q is majorized by 

ar(rJ-) < cR n ( V 1 ^ rR " 



< 



/ ■> rink J I / j m nk I \Q\ s — s / n k^ 

\p\Q / Vm^lQp/nfc / 

since t > s and the least prime not dividing Q is > |Q| 1/nfc , by (US}. There 
are at most Q nk possibilities for Q n (Vq — t), since Q fc A is the lattice of 
periods of Vq, so 

|L(y;^Q)n^(0)| < ^^- k +o(Rn 

AD(Q)/\) 1 / k nk r ir>n 

^ _|_ (f> n \ 

lQls~s/nk ^ ' 



for large R, where the second inequality results from (]A5j) and the fact that 
p k < D{Q)/^ for every prime factor p of Q. The result follows on dividing 
by R n and letting R tend to infinity (the existence of the limit on the left 
being guaranteed by Theorem [1]). □ 
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Theorem 4. h mcas (V) = 0. 

Proof. Given p > and a p-patch V of V, let Q := (An -B p (0)) \ 7> and, 
as in Lemma [2J define Q = Q(V) to be the product of all primes p with 
\Q/p k A\ < \Q\ and 

s = s (py- min |(Q + t)nVQ|. 

By (fT3j) of Corollary Q] and the fact that V C Vp with P the product of 
primes less than log p, we have 

(18 > l c l > ( 1 -cR))""""-° ( "") >! |f 

for large enough p. 

Now put 5 = S(p) := p n /y/\og 2 p. We shall calculate separately the 
contributions to the measure entropy h meas of the p-patches "P of V with 
s("P) > <5 and those with s{V) < S. The former patches have small fre- 
quency and the latter are few in number. 

For the p-patches with s(V) > S, Lemma [2] and (|18p give 

-log 2 v(P) > ^S\o g2 p-0{D(Qf/ k ) = \s\og 2P -0{pV k ) > ^£ 

for large enough p which, since — log 2 f is decreasing but —v\og 2 v is in- 
creasing for v G (0, 1/e], gives the estimate 

-v{V) log 2 v{V) = 0(2- slo ^^Slog 2 p). 

Since there are at most 2' AnBp (°)' p-patches in all, the contribution of the 
p-patches V with s(V) > S to the sum on the right of dH) is 



(19) 0(2l An ^(°)l-( 51o ^P)/3 < Slog 2 p) = 0(2- £ rv / ^ p ™yi ^ 2 7) = (1 



Turning to the p-patches with s(V) < S, denote this set of patches by 
B C A(p) and let F be their combined frequency. The contribution of these 
patches to the sum on the right of (JU) is 

which, since v log 2 v is a convex function of does not decrease if we replace 
the v(Vys by their average value, F/\B\. So this contribution is 

(20) <Flog 2 |£|-Flog 2 F<log 2 |S| + i^. 

To bound \B\ we note that if V £ B then there is a t € L(y\V, Q) with 
\Q n (Vq - t)\ < S. Since VcV-tcV Q -t, Q = (An B p (0)) \ V, and 
Q k A is the lattice of periods of Vq, V and Q are completely determined by 
this subset of B p (0) and by t modulo Q k A. There are Q nk cosets of Q k A 
in A and the number of subsets of A n B p (0) with fewer than S members is 
bounded above by 

| (' A n f <°») < ( L SJ + 1) ( |A n L fj (0)l ) < 25 («) S 
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for large p, by (jAlip . Hence the bound on the right of (|20p is majorized by 
51og 2 (e|A n B p (0)\/S) + log 2 2eS + nk\og 2 Q 

(M, = o { fOStei) + 0(l„ g2P ) + (p'A) = c(n 

V V lo S2 9 ) 

Since the contributions (|19p and (f2"Tj) are both o(p n ), h meas (V) = 0. □ 

Note on patch shapes. On the general principle of the isotropy of space, 
we have used spherical patches throughout and measured densities and fre- 
quencies through expanding spherical regions; but the results we obtain 
are independent of the shapes of these patches and regions: all our point- 
counting estimates stem from ()A9p which remains valid for an arbitrary 
expanding region in place of the expanding ball, with main term the volume 
of the region (using the volume of the fundamental region of the lattice as a 
unit) and an error term of smaller order provided the boundary of the region 
has n-dimensional measure zero. It is not even necessary for the shape of 
the density-defining regions to be the same as the (also expanding) patch 
shape. 

6. Variational principle 

Endowing the power set {0, 1} A of the lattice A with the product topology 
of the discrete topology on {0, 1}, it becomes a compact topological space 
(Tychonov's theorem). This topology is in fact generated by the metric 
where two subsets of A are e-close if e is the infimum of values s > with 
the property that they agree on B 1 / s (0). Let X be a subset of A. The closure 
X(X) of the set of lattice translations t + X (t G A) of X in {0, 1} A gives 
rise to the natural dynamical system (X(X),A), where X(X) is a compact 
topological space on which the action of A is continuous; cf. [4J and references 
therein for details. Denote by .M(X(X), A) the set of A-invariant probability 
measures on X(X) with respect to the Borel cx-algebra on X(X), i.e. the 
smallest cr-algebra on X(X) which contains the open subsets of X(X). For 
a fixed such measure p and a radius p > 0, let h p (p,) be the entropy of \x 
restricted to A(p), i.e. 

VeA(p) 

where A(p) denotes the set of p-patches of X and C-p is the set of elements 
of X(X) whose p-patch at is V, the so-called cylinder set with respect to 
V . The metric entropy of p is then given by the limit 



h(p) := lim 



h p (p) 



p->oo p' l V n 

which exists by a subadditivity argument; cf. [6] and also see |10j . As in 
Section [21 replacing the p(C-p)'s by their average value, 1/N(p), we see that 

h(p) < h pc {X) V/i G M(X(X),A). 

Since the topological entropy h top (K(X), A) of (X(X),A) coincides with 
h pc (X) by [H Theorem 1 and Remark 2], the variational principle for lattice 
actions on compact spaces here reads as follows; cf. [6j and [19j Sect. 6], 
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the latter being an extension of the case n = 1 from [22]. An elementary 
proof can be found in [13] . 

Theorem 5 (Variational principle). 

sup h(ji) = h pc (X). 

/J6M(X(X),A) 

Moreover, the supremum is achieved at some measure. □ 

In case of V, X(V) will also contain the empty set (cf. Proposition [1]) and 
various other subsets of A and thus admits many A-invariant probability 
measures. Moreover, h pc (V) = l/£(nk) by Theorem [3] Consider the fre- 
quency function v from above which gives the frequencies v(V) of occurence 
of /9-patches V of V in space. The function v, regarded as a function on the 
cylinder sets by setting v(C-p) := v{P), is finitely additive on the cylinder 
sets with v(K(V)) = YlveA(p) v ^Pv) = !• Since the family of cylinder sets is 
a (countable) semi-algebra that generates the Borel cr-algebra on X(V), one 
can use the method from [22\ §0.2] to show that v extends uniquely to a prob- 
ability measure on X(V). Moreover, this probability measure can be seen to 
be A-invariant. This shows that the measure entropy h meas (V) is indeed a 
metric entropy of a A-invariant probability measure on X(V). Certainly, an 
explicit characterisation of X(V) together with its A-invariant probability 
measures and corresponding metric entropies (in particular those measures 
H £ .M(X(V),A) with maximal entropy, i.e. /i(/x) = l/((nk)) would be 
desirable (but not simple). 

7. Diffraction spectrum 

In the following, we assume that the reader is acquainted with the math- 
ematics of diffraction as carefully laid out in [SJ; see also [2] and references 
therein for a review. We shall also use the notation and results from that 
text. In fact, the proofs presented below are straightforward modifications 
of the corresponding proofs in [5] and are only included for the reader's con- 
venience. For an alternative derivation of the diffraction spectrum in case 
of the visible lattice points, see [201 Sect. 5a]. 

A Dirichlet series we shall encounter below is 

m=l P 

which is absolutely convergent for 3?(s) > 1. 

Theorem 6. The natural autocorrelation of V exists and is supported on 
A, the weight of a point a £ A in the autocorrelation of V being given by 




with error term equal to 0(R~( l ~^ l l k ^ 2 ) for n = 1 and k > 2, 0(R^ 1 ^ 2 ) for 
n = 2 and k = 1 and 0(R^ 1 ) otherwise, where, in any case, the implied 
constant depends on a as well as on A. (For lattices A with determinant 
7^ 1 the weights above must be divided by det(A)J 
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Proof. Since the cases n = 1, k > 2 and n > 2, k = 1 were already treated 
in Theorems 1, 2 and 4], we may assume that n, k > 2. Since V — V C A, 
the autocorrelation of V (if it exists) can only be supported on A. The 
weight of a point a £ A in the autocorrelation of V is the limit as R — > oo 
of 

(23> ik £ 1 

a;,a:-aeVTlB fl (0) 

and, by [3 Lemma 1], the existence of this limit for each a £ A is sufficient 
to ensure the existence of the autocorrelation. 

It is convenient to drop the condition x — a G -Br(O) in (|23p . which then 
becomes 

(24) -L- y i. 

a;,ai— agV 
a3eBjj(0) 

The difference between these sums is 0(1 /R) by (|A9j) . due to the extra 
lattice points x within a constant distance ||a|| of the boundary of -Br(O) 
that are included in the latter. By (|A2p . this can be written as 

a;6AnBij(0)\{0,a} l\ci-(x) m\c k (x— a) 

Reversing the order of summation gives 

E E a*(0mh e l 

" l<i<5* l< m <5* i =eAnB fl (0)\{0,a} 

a;ez fe A 

x— o6m fc A 

where S 1 := ||a||)/A. Collecting terms with the same value of d = (l,m), 
noting that all x in the inmost sum belong to d k A and that there is no such 
x unless a G d k A, and putting I' := l/d, m' := m/d, x' := x/d k , a' := a/d k , 
we obtain 

( 25 ) E E E Ki'dMm'd) 1 - 

n d\c k (a) 1 < ; , <s i /d 1 < m/<S l /d x'eAnB R/dh (0)\{0,a'} 
(l',m')=l x'el' k A 

Since I' and m! are bound variables of summation and x' and a' will not be 
referred to again, we can drop the dashes: from now on I and m are the new 
I' and ml but a is the original a. 

By (|A9p with A replaced by (lm) k A, the inmost sum is 

R \ n „/ i? xn_1 



(dlm) k J \(dlm) 

These three terms give a main term and two error terms in (|25 j) . 
The first error term is majorized by 

\ E i Jk^I) E i ^iry I = 0(1/R) 

KKS* Km<SH 
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since the sums are convergent due to k(n — 1) > 2. 
The second error term is majorized by 

P)= (fw) = °"» 

since S = 0(R) and k(n — 1) > 2. So both error terms are 0(1/ R) and thus 
tend to as R — > oo. 
The main term is 

Ev - >. - fi(ld) fi(md) 

2_y (dlm) nk 

d\c k (a) 1 < |<SI ^ i< m< s%/d 
(l,m)=l 

Ev - >. - /_i(ld)fi(md) 

2-^ (dlm) nk 
d \°k( a ) iKKS^/d l<m<S^/d 
(l,d)=l (m,d)=l 
(i,m)=l 

/i 2 (d) x - x - /i(Zm) 



(l,d)=l (m,d)=l 

since /i is multiplicative and /j,(lm) = when (l,m) ^ 1. Since the last 
double sum is absolutely convergent, this converges to 

(26) y- V 2 ( d ) y^ K r )i~{r) 

d|c fc (o) r=l 
(r,d)=l 

as i? — t- oo. The difference between this limit and the partial sum above is 
0(1/ "R**- 1 ), so falls within the error estimate 0(1/R). 

Using (|22p. the expression for the limit (|26p can be rearranged as 



d|c fc (o) pfd 
ci squarefree 



d|cj.(o) p\d 
d squarefree 



p|cfc(a) V ^ V y 7 / 

«•*) n 



p|cfc(a) 

This completes the proof. □ 
Corollary 2. V - V = A. 

Proof. Trivially, one has V — V C A. From Theorem [6j one gets io(a) > 
for all a £ A and thus also A C V — V by [5J Lemma 1]. □ 
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The dual or reciprocal lattice A* of A is 

A* := {y E K n | y ■ x E Z for all a; € A} 

By definition, the denominator q of a point p E QA* is the smallest number 
a E N with ap E A* . This is also the greatest common divisor of the numbers 
a E N with ap E A*, i.e. ap E A* if and only if q | o. 

Theorem 7. T/ie diffraction measure 7 0/ f/ie autocorrelation j of V exists 
and is a positive, pure-point, translation-bounded measure which is concen- 
trated on the set of points in QA* with (k + l)-free denominator and whose 
intensity at a point with such a denominator q is given by 

( 27 ) £Hn~k) II Unk _ 1)2 • 

T/iis measure can also be interpreted as 

d squarefree 

a weak* -convergent sum (in fact, even \\ ■ \\\ oc -convergent sum) of Dirac 
combs. (For lattices A with determinant 7^ 1 the above formulas must be 
divided by the square of det(A) J 

Proof. Let 7 be the autocorrelation of V. As shown in the proof of Theo- 
rem O one has 

W (a) = i{nk) £ ) . 

d=l p|d V ^ 7 

d squarefree 

aSd^A 

So by Theorem [6] and [5, Lemma 1] one obtains 

2 V 1 

w d fc A 



2 \-i 



d squarefree 



Since Hw^aIIioc = O(l) and the coefficient of uj^a is 0(l/d nk ), this sum of 
tempered distributions is convergent in the weak*-topology by Lemma 
2]. By the Poisson summation formula for lattice Dirac combs [5, Eq. (31)], 
its term-by-term Fourier transform is 



««*> £ 3*11(1 2 



d=i fl P \d V P 7 

d squarefree 

= £H0 £ I II „2nk Z 2v nk J U A*/d» > 
d squarefree 

which weak*-converges to the diffraction measure of V, since the Fourier 
transform operator is weak*-continuous. Since ||wA*/d fe llioc = 0(d nk ) and the 
coefficient of i^x*/d k is 0(l/d 2nk ), the weak*-sum is a translation-bounded 
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pure-point measure equal to the pointwise sum of its terms by [5j Lemma 
2] This establishes the series form ()28p for the diffraction spectrum. 

The explicit values of the intensities can now be calculated as follows. 
Let p be a point in QA* with denominator q. We can assume that q is 
(k + l)-free, since otherwise there is no contribution to (|28p at all. The 
terms in (|28p that contribute to the intensity at p are those with d = mq*, 
where q* is the squarefree kernel of q and m G N is squarefree and coprime 
to q. Thus the intensity at p is 



One explicitly sees that 7 above is fully translation invariant, with lattice 
of periods A*, in accordance with Theorem 1 of pQ. Moreover, since the 
action of the group of automorphisms of A*, Aut(A*) ~ GL(n,Z), on QA* 
preserves the denominator, 7 is (A* xi Aut(A*))-symmetric. In particular, 
both 7 and the set V itself are GL(n, Z)-symmetric. 



What has kept the error term large in the argument as we have presented 
it so far is the last term of ©, with \V\ in the exponent of S (in the 
second component of the minimum). This arose from the 0(1) error term 
in ©, when Q was substituted for the inner sum in (|8j). The O(l) error 
term was not even a boundary effect: it was caused solely by lattices whose 
determinants are much larger than the volume of the region in which points 
are being counted. The result of this was to put the burden of keeping the 
last term of © small onto P (which occurs in the first component of the 
minimum), causing an increase in the error due to the tail of the ^-function 
product. Mirsky's idea in [2] and [13] was to show that the terms with some 
di large contribute a negligible amount to ([Sj) and can be discarded before 
the substitution of ([9]) is made. The remaining terms have the individual 
diS so well bounded that the second component of the minimum can take 
over the role of providing a respectable error term, freeing P to be assigned 

2 Note that 7 is even a || ■ ||i oc -convergent sum of Dirac combs. Since convergence with 
respect to the local norm preserves the spectral type, it is thus clear that 7 is a pure-point 
measure; cf. [3] Theorem 8.4]. 




Using the Euler products in (|A3[> and (|22p this simplifies to 




which agrees with (I27p . 



□ 



8. Improving the error terms 
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a much larger value and thus reducing the size of the tail of the (^-function 
product. 

Let r G Z + , let ... , p r G A, and let mi, . . . , m r G N. Define the symbol 
E ( p 1 .]..,'^) as 1 or according to the system of congruences in t G A, 

(29) t + p^emiA (l<i<r), 

being solvable or not. Further, for a positive real number R and a point 
a; G W 1 , let T^a;; /?; p^^^. r ^ denote the number of points t G A such that 

t G B fl (a:) , 
* + Pi £ m «A (1 < i < r) . 

We denote by [mi, . . . , m r ] the least common multiple of mi, . . . , m r . Fur- 
ther, (mi,m.j) denotes the greatest common divisor of m% and rrij. For 
brevity, let c(Z) denote the 1-content of a nonzero point Z G A. 

Lemma 3. The system (|29p o/ congruences is soluble if and only if 

(mi, rrij) | c(pi - pj) (1 < i < j < r) . 
In the case of solubility, the solutions form precisely one residue class 

(mod [mi, . . . , m r ]A) . 

Proof. This is an immediate consequence of \1'6\ Lemma 1] applied to each 
coordinate with respect to a basis of A. □ 

If mi, . . . , m r are pairwise coprime, the last result boils down to the Chi- 
nese Remainder Theorem ()A10p . In fact, only this special case will be needed 
in Theorem [8] below. However, since the subsequent lemmas may be of in- 
dependent interest, we prefer to stick to the general case. 

Lemma 4. 

t(x; r; ';;'::;;:<) = R n v n [p — pJ ]n + o ( — - — „ ) + o(i) , 

y t-i, ,f T j [mi,...,m r J n \ [mi, . . . , m r \ n L J 

where the implied O- constants depend only on A. 

Proof. This is an immediate consequence of Lemma [3] together with (|A9p 
applied to the lattice [mi, . . . , m r ]A. □ 

Lemma 5. 



[mi, . . . , m r \ mi • • • m r 
where K depends only on r,p 1 , . . . ,p r . 

Proof. This follows along the same lines as Lemma 3 of [13] by employing 
Lemma [3] instead of [13, Lemma 1]. □ 

For points p 1 ,...,p r G A and positive real numbers R and a, denote 
by L(x;R;p 1 , . . . ,p r ;a) the cardinality of systems (t, a%, . . . , a r ) of lattice 
points t G A and numbers a±, . . . , a r G N such that 

t G B R (x) , 
t+PjGa^A (l<i<r), 
a± ■ • ■ a r > R a . 
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Lemma 6. 

L{x; R; Pl , . . . ,p r ;a) = O ^R n ^ nk -^ + ^ + ^R n ~ 1+ ^- 

where the implied O- constants depend only on A, k, r,Pi, . . . ,p r . 

Proof. The proof is by induction on r. For r = 1, we can apply Lemma H] to 
obtain 

L(x;R; Pl ;a) = £ 1 = £ t(x;R;* 

t£B R (x) a 1 >R a 
t+ Pl ealK a 1 <((R+\\p 1 \\)/\) 1 / k 
ai >R a 

ai >R a 1 °1 

ai<((iJ+|| J » 1 ||)/A)V* 

= O ^-a(nfc-l)^ + O {RT 1 - 1 log fl) + 0{R l,k ) , 

where there is no middle term when n = 1 and the logarithm in the middle 
term is only needed in the case n = 2, k = 1, when the corresponding har- 
monic series l/ a i™ ^ diverges. In all other cases, these series converge, 
and the middle term can be taken as 0(R n ~ 1 ). Thus the lemma holds for 
r = 1. Assume now that the assertion holds for some r > 1. Let j3 be a pos- 
itive real parameter to be fixed later. Writing a = a\ ■ ■ ■ a r +i, for symmetry 
reasons one has 

L(x;R; Pl ,...,p r+1 ;a) = 0( J] l) + 0( J] 1)=L X + L 2 , 

t£B R (x) teB R (x) 

t+ Pr+1 ea^ +1 A t+ Pr+1 ea* +1 A 

a>R a a>R a 
— — <x$ ai -a r >x^ 

a l a r + l~ 

say. Employing Lemmas [3] and one obtains 

t&B R {x) _R° ! <a<_R' 3 ('-+ 1 )/'- 

t+p r+1 Ga£ +1 A 

_R a <a< J R' 3 ( r + 1 )/ r 

£ / , a},...,a* +1 



Of V - VPi--.Pr + i; + jR n-l + 1 

Offln y i + o(iT-l+£(r+l)/r+e 

V ^ (ai •••a r+ i) nfc / V 

O ( R a - a ( nk - 1 )+ € \ -\- O ( R n - 1 +P( r + 1 )/ r + £ 
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With r denoting the ordinary divisor function, one further obtains 
L2 = 0( E !) = °( E ^(H*+Pr+ill/A) 

teB R (x) t+p r+1 ea k +1 A t&B R (x) 
t+PiSa^A t+PiEa^A 

t+p r Sa*A t+p r ea*A 
ai---a r >x' 3 ai ■■■a r >x^ 

= 0( R ^ = 0(R e L(x;R; Pl ,..., Pr ;f3)) 

teB a (x) 

t+PiSa^A 

t+p r £a k A 
ai---a r >xP 

= O (Rn~P(nk-l)+2e\ + q f^-1+^+2, 

by assumption. Setting /3 := rr J^., 1 , we obtain 

L(a;; J2; Pl , . . . ,p r+1 ; a) = O (V-a(nfc-i)+A + ^n-i+^+2< 
which proves the lemma. □ 

We are now in a position to improve the error term of Lemma [TJ 

Theorem 8. Let V be a finite subset of A, m G N , m € A, P be a natural 
number coprime to m and x G M. n . Then 

\L(V P ;V,<D) n {m + mK)C\B R (x)\ 



is 



^n(i-^)+°(«"- i+ - + ')i 

p\p 

where the O- constant depends only on A, k and V . 

Proof. This follows from the following modification of the proof of Lemma [TJ 
We shall also use the notation from that proof. It suffices to show that ([5]) 
is 

R n v n 



nfc+l 



m nQnk ±1^ ^ p 

To this end, divide ([8]) as C\ + C2, where 

C ^= E E ••• E tib'-dr) E L 

fii|^/Q d 2 \P/Q d r \P/Q teAnB R (x) 
d 1 ---d r <R~^ t&q+rnQ k A 

and C2 consists of the terms with d\ ■ ■ ■ d r > . By Lemma [6j 
C 2 = (R n -^ + ^+0(R n - 1+ ^ + ^ = 0(R n - 1+ ^- 
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One further obtains 

c i = ££■■■£ Mi-dr) E 1 

di\P/Q d 2 \P/Q d r \P/Q teAC\B R (x) 
d\---dr< teq+mQ k A 

= E E E ^...d^R^;;;^ 

di\P/Q d 2 \P/Q d r \P/Q 
i 

dl ■ ■ ■ d r < Rnk 

Since the di,...,d r are pairwise coprime and since (mQ,d) = 1, Lemma 2] 
in conjunction with Lemma [3] shows that 

T ( R .d<t,...,d*,mQ<°\ = Rnv n Q ( \ +ofll 

V ' ' Pi,-*r.« ) m n (dQ) nk ^ \m n - 1 (dQ)( n - 1 ) k J { ' 

m n (dQ) nk ^ 1 ' ' 

Just as in the proof of Lemma[H substituting this in the above expression for 
C\ and removing the condition d < from the sum over R n v n / (m n (dQ) nk ) 
gives the main term 

rntiOnk 1 J. \ n nk 
m ^ P |P/Q V P ' 

The error from the extra terms included in the extended multiple sum is 



°{ Rn E Tj ^)=o{R n - l+ ^) = o{R r ' 



nk + 1 

d nk ) x ~~ ' \~~ 

d>R^ 

Similarly, the sum over the error term can be seen to be 0(R n ~ 1+ ^k +e ). 
Altogether, this proves the assertion. □ 

One can now employ Theorem [8] instead of Lemma [1] to see that the error 
terms in Corollary [1] and Theorem [1] of the form 0(R n /(log R) nk ~ 1 ) can 
indeed be improved to 



0(R 



n—l+ 



nk + 1 



More precisely, Theorem [8] allows one to choose P as large as the product 
of primes less than R l K nk+l "> (instead of log-R) in the modified proofs. 



Appendix A. Facts from number theory 

We have used a number of standard facts from number theory in this 
paper, which we collect here with proper references for convenience. 

The inclusion- exclusion principle says that if we have a set of N elements 
and a list of properties P\, P2, . . ., with N{ elements having property Pi, Nij 
having both properties Pi and Pj, and so on, then the number of elements 
having none of the properties is 



(Al) N-N1-N2 + A12 + A123 B Thm. 260] . 
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The Mobius function is defined for m € N by 
( 



fi(m) := < 



1 when m = 1, 

(— l) r when m is the product of r distinct primes, 
otherwise, 



and has the property that, for any m € N, 

(A2) V n(d) = I 1 lf m = [8, Thm. 263], Thm. 6.3.1] 

otherwise, 

(the basis of the Mobius inversion formula) , derived by applying the inclusion- 
exclusion principle to the singleton set {m} with the property Pj being di- 
visibility by the ith prime. 

The Riemann (^-function is defined for K(s) > 1 by 

(A3) cw==f;^= n 

m=l p prime 

and, as a result of Mobius inversion, 



CO 



£ Mm) = 1 Thm ^ §g 

m=l v ' 

We needed to approximate partial Euler products, slightly more general 
than that on the right of (|A3p . Let r > and s > 1 be fixed. Then 



o^nf 1 -^) = E'^O-i) 

r 

> ^ if N s > r 



p>N P' 

r 



> 

^ ~ 1 p>7V ^ 



-E 1 

1 ^ V s 



> 



r 



Hence, on exponentiating, 



^_l (A r_ 1)s -i 



l-s\ 



( A4) n (i-i =l-0(iV 

We also had to estimate the product of primes up to a given bound: 
(A5) n^ <4JV 18, Thm. 415]. 

p<N 

The r- divisor function r r (m), for r > 2, is the number of ways of expressing 
m as an ordered product of r natural numbers. The special case r = 2 is 
the ordinary divisor function 

r(m) := T2(m) = 1. 

d\m 
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which satisfies r(m) = 0(rrf) for every e > [8, Thm. 315], [91 Thm. 6.5.2], 
from which we deduce that also 

(A6) T r (m) < r(m) r = 0{m e ) 

Another divisor sum estimate we have used is 

(A7) \ = O(loglogm) [8, Thm. 323]. 

d\m 

An arithmetic function f(m) (defined on natural numbers m) is called 
multiplicative if f(m\rri2) = f (rrii) f (1712) whenever (7111,1712) = 1. For ex- 
ample, the functions /z(m) and r r (m) are clearly multiplicative. A Dirichlet 
series, Ylm=i f( m )/ mS i whose coefficients f(m) are multiplicative can be 
expressed as an Euler product over the primes p, 

m=l p ^ ' 

for all values of s for which the sum is absolutely convergent. 
An estimate we have used frequently is 

(A9) IW"*I=S+»((^) M ")+»(1). 

approximating the number of points of an n-dimensional lattice A in a large 
ball B„(x) (the last error term being required only when detA is bigger 
than p n ). This is obtained by dividing B p (x) into fundamental regions 
for A, each of volume det A and containing one point of A, with the error 
terms arising from fundamental regions that overlap the boundary of B p (x). 
The O-constants depend on the shape of A, but not on its size, and are 
independent of x. A more precise version is given as Proposition 1 of [5]. 

We have also made much use of the Chinese Remainder Theorem in the 
form that if m\,rrt2, ■ ■ ■ , m r are points of a lattice A and m\,m2, ■ ■ ■ , m r 
is a set of natural numbers that are pairwise coprime, then there is a point 
t £ A such that for all points x G A 

(A10) x = rrti (mod mjA) for i = 1, . . . ,r <?=^ x = t (mod m\ ■ ■ ■ m r A). 

This is given as Proposition 2 of [5] and is proved by applying Theorem 2.7.2 
of [S] (or Theorem 121 of 0) to each coordinate relative to a basis of A. 

We needed a simple upper bound for binomial coefficients. By comparison 
with J l \ogxdx, we have, for s E N, 

log si > s log s — s + 1 

(a weak, one-sided version of Stirling's formula), so s! > [s/e) s and hence 

/ m\ m(m — 1) ■ ■ ■ (m — s + 1) m s /em\ s 

< A11 > {.)-- — -i - £ ir<(v)- 
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